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Abstract. We characterize the compatibility of a collection of unrooted phylogenetic 
trees as a question of determining whether a graph derived from these trees — the display 
graph — has a specific kind of triangulation, which we call legal. Our result is a counterpart 
to the well known triangulation-based characterization of the compatibility of undirected 
multi-state characters. 



1. Introduction 

A phylogenetic tree or phylogeny is an unrooted tree T whose leaves are in one-to- 
one correspondence with a set of labels (taxa) C{T). If C{T) = X, we say that T is a 
phylogenetic tree for X, or a phylogenetic X-tree fSJ. A phylogenetic tree represents the 
evolutionary history of a set of species, which are the labels of the tree. 

Suppose r is a phylogenetic tree. Given a subset Y C C{T), the subtree ofT induced 
by Y, denoted T|F, is the tree obtained by forming the minimal subgraph of T connecting 
the leaves with labels in Y and then suppressing vertices of degree two. Let T' be some 
other phylogenetic tree such that C{T') C C{T). We say that T displays T' if T' can be 
obtained by contracting edges in the subtree of T induced by L{T'). 

A profile is a tuple V — (Ti, T2, . . . , T^), where each Ti is a phylogenetic tree for some 
set of labels C{Ti). The T^s are called input trees, and we may have C{Ti) n C{Tj) ^ 
for i ^ j- A supertree for P is a phylogeny T with C{T) — IJiLi ^{Ti)- Profile V is 
compatible if there exists a supertree T for V that displays Ti, for each i e {1, . . . , k}. 
The phylogenetic tree compatibility problem asks, given a profile V, whether or not V is 
compatible. This question arises when trying to assemble a collection of phylogenies for 
different sets of species into a single phylogeny (a supertree) for all the species \A\. The 
phylogenetic tree compatibility problem asks whether or not it is possible to do so via a 
supertree that does not conflict with any input tree. 

Phylogenetic tree compatibility is NP-complete |9| (but the problem is polynomially- 
solvable for rooted trees Nevertheless, Bryant and Lagergren have shown that the 
problem is fixed-parameter tractable for fixed k 1 2 1 . Their argument relies on a partial char- 
acterization of compatibility in terms of tree-decompositions and tree-width of a structure 
that they call the "display graph" of a profile (this graph is defined in Section[3]i. Here we 
build on their argument to produce a complete characterization of compatibility in terms 
of the existence of a special kind of triangulation of the display graph. These legal triangu- 
lations (defined in Section[3l) only allow certain kinds of edges to be added. Our result is a 
counterpart to the well-known characterization of character compatibility in terms of trian- 
gulations of a class of intersection graphs |3|, which has algorithmic consequences [SIQ. 
Our characterization of tree compatibility may have analogous implications. 
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2. Preliminaries 

Let G be a graph. We write V{G) and E{G) to denote the vertex set and edge set of G, 
respectively. Suppose C is a cycle in G. A chord in C is any edge of G whose endpoints 
are two nodes that are not adjacent in C. G is said to be chordal if and only if it every 
cycle of length at least four has a chord. A graph G" is a chordal fill-in or triangulation of 
G if V{G') = V{G), E{G') D E{G), and G' is chordal. The set E{G') \ E{G) is called 
a fill-in for G and the edges in it are called^//-/n edges. 

A tree decomposition for a graph G is a pair (T, B), where T is a tree and B is a 
mapping from V{T) to subsets of V{G) that satisfies the following three properties. 

(TDl) {Vertex Coverage) For every v £ V{G) there is an a; e y{T) such that v G B{x). 
(TD2) {Edge Coverage) For every edge {w, v} G E{G) there exists an a; e y{T) such 
that {u, v} C B{x). 

(TD3) {Coherence) For every u G V{G) the set of vertices {x G V{T) : u e B{x)} 
forms a subtree of T. 

It is well known that if G is chordal, G has a tree-decomposition (T, B) where (i) there 
is a one-to-one mapping G from the vertices of T to the maximal cliques of G and (ii) 
for each vertex x in T, B{x) consists precisely of the vertices in the clique C{x) |6|. 
This sort of tree decomposition is called a clique tree for G. Conversely, let (T, B) be 
a tree decomposition of a graph G and let F be the set of all {u, w} ^ ^'(G) such that 
{m, u} C 5(2;) for some x G ^^(T). Then, is a chordal fill-in for G f6l. We shall refer 
to this set F as the chordal fill-in of G associated with tree-decomposition (T, B) and to 
the graph G' obtained by adding the edges of F to G as the triangulation of G associated 
with {T,B). 

3. Legal Triangulations and Compatibility 

The display graph of a profile V = (Ti, . . . , Tk) is the graph G = G{V) formed from 
the disjoint graph union of Ti, . . . , by identifying the leaves with common labels (see 
Fig. 1 of |2|). An edge e of G is internal if, in the input tree where it originated, both 
endpoints of e were internal vertices; otherwise, e is non internal. A vertex u of G is called 
a leaf if it was obtained by identifying input tree leaf nodes with the same label i. The 
label of V is £. A non-leaf vertex of G is said to be internal. 

A triangulation G' of the display graph G is legal if it satisfies the following conditions. 

(LTl) Suppose a clique in G' contains an internal edge. Then, this clique can contain no 

other edge from G (internal or non internal). 
(LT2) Fill-in edges can only have internal vertices as their endpoints. 

Note that the above conditions rule out a chord between vertices of the same tree. Also, 
in any legal triangulation of G, any clique that contains a non internal edge cannot contain 
an internal edge from any tree. 

The importance of legal triangulations derives from the next results, which are proved 
in the next section. 

Lemma 1. Suppose a profile V = (Ti, . . . , Tk) of unrooted phylogenetic trees is compat- 
ible. Then the display graph ofV has a legal triangulation. 

Lemma 2. Suppose the display graph of a profile V — (Ti, . . . , T^) of unrooted trees has 
a legal triangulation. Then V is compatible. 

The preceding lemmas immediately imply our main result. 
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Theorem 1. A profile V ~ (Ti, . . . , Tk) of unmoted trees is compatible if and only if the 
display graph ofV has a legal triangulation. 

4. Proofs 

The proofs of Lemmas [T] and |2]rely on a new concept. Suppose Ti and T2 are phyloge- 
netic trees such that C{T2) C C{Ti). An embedding function from Ti to T2 is a surjective 
map (f) from a subgraph of Ti to T2 satisfying the following properties. 

(EFl) (p maps labeled vertices to vertices with the same label. 
(EF2) For every vertex v of T2 the set (t>^^{v) is a connected subgraph of Ti . 
(EF3) For every edge {u, v} of T2 there is a unique edge {u', v'} in Ti such that (t>{u') = u 
and (t){v') = V. 

The next result extends Lemma 1 of 121 . 

Lemma 3. Let Ti and T2 be phylogenetic trees and C{T2) Q C{Ti). Tree Ti displays 
Tree T2 if and only if there exists an embedding function (pfrom Ti to T2. 

Proof. The "only if" part was already observed by Bryant and Lagergren (see Lemma 1 
of ill). We now prove the other direction. 

To prove that Ti displays T2, we argue that T2 can be obtained from Ti\C{T2) by a 
series of edge contractions, which are determined by the embedding function (j> from Ti to 
T2. Let Tl be the graph obtained from ri|£(r2) by considering each vertex v of T2 and 
identifying all vertices of (j>^^{v) in Ti\C{T2) to obtain a single vertex u' with (j){u') = v. 
By property (EF2), each such step yields a tree. By properties (EF1)-(EF3), each vertex v 
of Ti \C{T2) is in the domain of <f>. Thus, function cj) is now a bijection between T2 and T{ 
that satisfies (EF1)-(EF3). 

We claim that for any two vertices u,v ^ V(T2), there is an edge {u, v} G E{T2) if 
and only if there is an edge <j)^^{v)} G E{T[). The "only if" part follows from 

property (EF3). For the other direction, assume by way of contradiction that {x, ^ 
E{T2), but that {(j)^'^{x),4)^''-{y)} G E{Tl). Let P be the path between vertices x and y 
in T2. By property (EF3), there is a path between nodes (l)^^{x), (t)^^{y) in tree T[ that does 
not include the edge {(/)~^(a;), This path along with the edge {(f)^^ [x) , (p^^ (y)} 

forms a cycle in T[, which gives the desired contradiction. 

Thus, the bijection between T2 and T[ is actually an isomorphism between the two 
trees. It now follows from property (EFl) that Ti displays T2. □ 

The preceding lemma immediately implies the following characterization of compati- 
bility. 

Lemma 4. Profile V = (Ti, . . . ,Tk) is compatible if and only if there exist a supertree T 
for V and functions (fii, . . . , (j)k, where, for i = I, . . . ,k, (pi is an embedding function from 
T to Ti. 

Proof of LemmaUl If P is compatible, there exists a supertree for V that displays Ti for 
i — 1, . . . , k. Let T be any such supertree. By LemmalU for i = 1, . . . , k, there exists an 
embedding function (pi from T to Ti. We will use T and the (piS to build a tree decompo- 
sition (Tg, B) corresponding to a legal triangulation G' of the display graph G of P. The 
construction closely follows that given by Bryant and Lagergren in their proof of Theorem 
1 of (|2j; thus, we only summarize the main ideas. 

Initially we set Tg = T and, for every t; G V{T), B{v) — {(j>i{v) : u in the domain of 0^ ; 1 < 
i < k}. Now, {To, B) satisfies the vertex coverage property and the coherence property. 
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but not edge coverage 12|. To obtain a pair (Tg, B) that satisfies all three properties, sub- 
divide the edges of Tq and extend B to the new vertices. Do the following for each edge 
{x, y} of To- Let F = {{ui, vi}, . . . , {um, Wm}} be set of edges of G such that Ui £ B{x) 
and Vi e B{y). Observe that F contains at most one edge from Ti, for i — 1, . . . , k (thus, 
TO < k). Replace edge {x, y} by a path x, zi, . . . , Zm,y, where zi, . . . , z^. are new ver- 
tices. For i = 1, 2, . . . , TO, let B{zi) = {B{x) n B{y)) U {wi, . . . , u^, Wi, . . . , Um}- The 
resulting pair (Tg, B) can be shown to be a tree decomposition of G of width fc (see |2]). 
The preceding construction guarantees that (Tg, B) satisfies two additional properties: 

(i) For any x £ V{Tg), if B{x) contains both endpoints of an internal edge of Ti, for 
some i, then B{x) cannot contain both endpoints of any other edge, internal or not. 

(ii) Let X e V(Tg) be such that B{x) contains a labeled vertex v e V{G). Then, for 
every u & B{x) \ {v}, {v, u} e E{G). 

Properties (i) and (ii) imply that the triangulation of G associated with {Tq, B) is legal. 

□ 

Next, we prove Lemma |2] For this, we need some definitions and auxiliary results. 
Assume that the display graph of profile V has a legal triangulation G' . Let (T', B) be 
a clique tree for G". For each vertex v e V{G), let N{v) denote the set of all nodes in 
the clique tree T' that contain v. Observe that the coherence property implies that N{v) 
induces a subtree of T' . 

Lemma 5. Suppose vertex v is a leaf in tree Ti, for some i G {1, . . . , k}. Let U{v) — 
V}x£N(v) B{x). Then, for any j e {1, . . . , k}, at most one internal vertex ufrom input tree 
Tj is present in U{v). Furthermore, for any such a vertex u we must have that {u, v} G 
E{G). 

Proof. Follows from condition (LT2). □ 

Lemma 6. Suppose e — {u, u} is an internal edge from input tree Ti, for some i G 
{1, . . . , k}. Let U{e) = U:Ee{w(n)nJV(t,)} B{x). Then, 

(i) U{e) contains at most one vertex of Tj, for any j G {l,...,fc}, j ^ i, and 

(ii) V{T,)nUie) = {u,v}. 

Proof. Part (ii) follows from condition (LTl). We now prove part (i). 

Assume by way of contradiction that the claim is false. Then, there exists a j ^ i and 
an edge {x, y} G T' such that e C B{x), e C B{y), and there are vertices a,b ^ V{Tj), 
a ^ b, such that a G B{x) and b G B{y). 

Deletion of edge {x, y} partitions V{T') into two sets X and Y. Let P = {a e V{Tj) : 
a G B{z) for some z e X} and Q = {6 G V{Tj) : b G B{z) for some z G Y}. By the 
coherence property, (P, Q) is a partition of V{Tj). There must be a vertex p in set P and 
a vertex q in set Q such that {p, q\ G EiTf). Since G" is a legal triangulation, there must 
be a node z in T' such that p,q ^ B{z). Irrespective of whether z is in set X or Y, the 
coherence property is violated, a contradiction. □ 

A legal triangulation of the display graph of a profile is concise if 
(CI) each internal edge is contained in exactly one maximal clique in the triangulation and 
(C2) every vertex that is a leaf in some tree is contained in exactly one maximal clique of 
the triangulation. 

Lemma 7. Let G be the display graph of a profile V. If G has a legal triangulation, then 
G has a concise legal triangulation. 
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Proof. Let G" be a legal triangulation of the display graph G of profile V that is not concise. 
Let {T',B) be a clique tree for G' . We will build a concise legal triangulation for G 
by repeatedly applying contraction operations on {T',B). The contraction of an edge 
e = {x, y} in T' is the operation that consists of (i) replacing x and j/ by a single (new) 
node z, (ii) adding edges from node z to every neighbor of x and y, and (iii) making 
B{z) — B{x) U B{y). Note that the resulting pair (T', B) is a tree decomposition for G 
(and G'); however, it is not guaranteed to be a clique tree for G' . 

We proceed in two steps. First, for every leaf w of G such that \N{v)\ > 1, contract 
each edge e = {x, y} in T' such that x,y G N{v). In the second step, we consider each 
edge e — {u, v} of G such that \N{u) n -^^(f )| > 1, contract each edge {x, y} in T' such 
that y G N{u) n N{v). Lemma|5] (respectively, Lemma|6ll ensures that each contraction 
done in the first (respectively, second) step leaves us with a new tree decomposition whose 
associated triangulation is legal. Furthermore, the triangulation associated with the final 
tree decomposition is concise. □ 

Proof of Lemma]2\ We will show that, given a legal triangulation G' of the display graph 
G of profile V, we can generate a supertree T for V along with an embedding function 
from T to Ti, for i = 1, . . . , fc. By Lemma]?] this immediately implies that V is compatible 
By Lemma]?] we can assume that G' is concise. Let (T'. B) be a clique tree for G'. 
Initially, we make T = T'. Next, for each node x of T, we consider three possibilities: 

Case 1: B{x) contains a labeled vertex v of G. Then, w is a leaf in some input tree Ti, 
further, by conciseness, x is the unique node in T such that v € B{x), and, by the 
edge coverage property, if u is the neighbor of v in Ti, u € B(x). Now, do the 
following. 

(i) Add a new node Xy and a new edge {x, Xy} to T. 

(ii) Label Xy with I, where I is the label of v. 

(iii) For each i G {1, . . . , fc} such that u is a leaf in Ti, make cj)i{xy) — v and 
4)i{x) = u, where u is the neighbor of v in T^. 

Case 2; -B(a;) contains both endpoints of an internal edge e of some input tree Ti. By 
legality, B{x) does not contain both endpoints of any other edge of any input tree, 
and, by conciseness, x is the only node of T that contains both endpoints of e. 
Now, do the following. 

(i) Replace node x with nodes x^ and Xy, and add edge {xy , Xy}. 

(ii) Add an edge between node Xy and every node neighbor y of x such that 
u e B{y). 

(iii) Add an edge between node Xy and every neighbor y of a; such that v G B{y). 

(iv) For each neighbor y of x such that u ^ B{y) and v ^ B{y), add an edge 
from y to node x^ or node Xy, but not to both (the choice of which edge to 
add is arbitrary). 

(v) Make (i>i{xu) = u and (i>i{xy) = v. 

Case 3; B{x) contains at most one internal vertex from Ti far i G {1, . . . , fc}. Then, for 
every i such that B{x) D V{Ti) ^ make (j>i{x) = v, where v is the vertex of Ti 
contained in B{x). 

By construction (Case 1) and the legality and conciseness of {T' ,B), for every t G 
[Ji^i C-{Ti) there is exactly one leaf x G V{T) that is labeled I. Thus, T is a supertree of 
profile V. Legality also ensures that the function (pi is a surjective map from a subgraph 
of T to Ti. Furthermore, the handling of Case 1 guarantees that 0; satisfies (EFl). The 
coherence of (T', B) and ensures that (f)i satisfies (EF2). The handling of Case 2 and 
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conciseness ensure that (pi satisfies (EF3). Thus, (pi is an embedding function, and, by 
LemmalU profile V is compatible. □ 
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